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' This work considers the allocation problem for multivariate stratified random sampling 

^ i . as a problem of integer non-linear stochastic multiobjective mathematical programming. 

With this goal in mind the asymptotic distribution of the vector of sample variances is 
studied. Two alternative approaches are suggested for solving the allocation problem 
for multivariate stratified random sampling. An example is presented by applying the 
j. . different proposed techniques. 
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1 Introduction 



The theory of probabilistic sampling is one of the topics of statistical theory that is most 
commonly used in many fields of scientific investigation. An effective survey technique in 
a population can be seen as an appropriate extraction of useful data which provides mean- 
ingful knowledge of the important aspects of the population. Stratified random sampling is 
one of the classical methods of the theory of probabilistic sampling for obtaining such infor- 
mation. The computation of the stratum sample size in stratified random sampling can be 
computed by diverse approaches, but optimum allocation, according to some criteria, has 
been found to be a useful approach. Optimum allocation has been stated as a non-linear 
mathematical programming problem in which the objective function is the variance subject 
to a cost restrict ion, or vice v ersa. Typically, thi s problem has been solved by using the 
Cauchy-Sch warz (Stuart . 19541 ) ineq uality, cited in ICochranl (jl977h or Lagrange's multiplier 



method, see lSukhatme et al. 



(|1984h 



Classical survey theory considers a single decision variable or characteristic; for exam- 
ple, in our case, univariate stratifi ed random samp l ing studies one character istic, the sample 
size and its strata allocation, see ICochranl rtl977h . ISukhatme et al. I rtl984T ) and iThompsonl 
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(Il997h . Moreover, in the context of stratified random sampling, diverse multivariate ap- 
proaches have been proposed whereby t he sample size and its a llocat ion within strata take 
i nto a ccount several characteristics, see lSukhatme et al. I (119841') and lArthanari and Dodge 
(|l98ll ). A detailed study of this problem is given by iDiaz-Garcia and Ulloal (|2008l ) . 

In univariate and multivariate stratified random sampling, when the optimum allocation 
is performed, and the cost function is the objective function, subject to certain variance 
restrictions in the different characteristics, then the problem can be reduced to a classi- 
cal mathematic al programming problem, and for t his pu rpose there are two well-known 
approaches: see ICodu ^| fj977h . ISm^tmT^TZl ffl) for the univariate case and see 
Arthanari and Dodgel (198ll) for the multiv aria te one, both fro m a deterministic point of 



view: aud see lpfaz-Garc"d Garavl and&S for a stochastic approach, 

respectively. Observe that, in the latter case (stochastic multivariate case with the cost func- 
tion as the objective function) the problem can be solved by using any of the techniques 
presented in IDiaz-Garcia and Garayl (|2007l ) . among many others. 

Alternatively, if the interest is to minimise the variances subject to a cost function, 
or to a given samp le size, then se ve ral approaches ca n be adopted to solve t his, f or the 



univariate case see 



sec 



Cochran! (Il977h. iThompsonl (119971) and ISukhatme et aL~\ (|l984h : and 



Sukhatme et al. 



(1984) and Dfaz-Garcia and Ulloa ( 20081 ) for the multivariate 



case. 



Dfaz-Garcia and Ulloa ( 20081 ) show that all the previously published approaches in this 



area are particular cases of the multiobjective mathematical programming technique, they 
proposed a unified theory for solving the problem of optimum al l ocatio n in multivariate 
stratified random sampling. Furthermore, Dfaz-Garcia and Ulloa ( 20081 ) propose the op- 
timum allocation in multivariate stratified random sampling as a nonlinear problem of 
integer matrix mathematical programming constrained by a cost function or by a given 
sample size. Also, by defining a particular vectorial function of the objective function of 
the matrix mathematical programming problem, they show that the optimum allocation in 
multivariate stratified random sampling also can be studied as a non-linear multiobjective 
integer mathematical programming problem. 

A topic with very little work done is the problem of optimum allocation in multivariate 
stratified random sampling when it is assumed that the variances in each stratum are 
unknown (the most common case i n practice) , in whic h cas e it would be needed to estimate 
these variances, see Cochran (1977), Thompson ( 1997 ) and Sukhatme et al. ( 19841 ). Under 
this context, the set of estimated variances for each stratum are random variables. This last 
situation has an important consequence when the optimum allocation problem in stratified 
random sampling is stated as a nonlinea r mathematical programming. The univariate case 
for this setting was studied in deta il by IDiaz-Garcia and Garayl (|2007l ) . The present work 
owes some ideas to iMelaku I dl986l ) who studies the asymptotic normality of the optimal 
solution in multivariate stratified random sampling by taking into account the randomness 
of the set of estimated variances for each stratum. 

The theory of Stochastic Mathematical Programming or Stochastic Optimisation is a 
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well established field of study that deals with the following general problem: 



mm 



subject to 
^(x,O>0, 3 = 1,2... 



(1) 



where x is fc-dimensional and £ is m-dimensional. If x or £ are rando m, then (E D) de- 
fines a multi objective stochastic m a them atical programming problem, see Prekopa (|1995 . 
p. 234) and Uryasev and Pardalosl ( 2001 ). As shall be seen in the following sections, the 
optimum allocation problem in stratified random sampling can be proposed as a stochastic 
multiobjective mathematical programming problem. 

This paper formulates the optimum allocation in multivariate stratified random sam- 
pling as a stochastic multiobjective integer mathematical programming problem constrained 
by a cost function or by a given sample size. Section [2] includes some notation and defini- 
tions on multivariate stratified random sampling under two approaches. Section [3] studies 
the asymptotic normality of the vector of sample variances. Two alternative approaches 
for solving the integer stochastic multiobjective mathematical programming problems are 
proposed in Section HI In Section El under the first approach, diverse particular stochastic 
solutions are proposed. Similarly, several particular solutions are derived under the second 
suggested approach, see Section El Section [7] establish some equivalent deterministic pro- 
grams to the stochastic solution stated in Section [6j Finally the techniques proposed are 
applied to an example of the literature, see Section El 



2 Preliminary results on multivariate stratified random sam- 
pling 

Assume a population of size N, divided into H sub-populations (strata). We wish to find 
a representative sample of size n and an optimum allocation in the strata, meeting the 
following requirements: i) to minimise the variance of the estimated mean subject to a 
budgetary constraint; or ii) to minimise the cost subject to a constraint on the variances; 
this is the clas si cal problem in op t imum allo cation in un i variat e stratified sampling, see 
Cochran ( 1977 ). Sukhatme et al. ( 19841 ) and Thompson (1997). However, if more than 
one characteristic (variable) is being considered then the problem is known as optimum 
allocation in multivariate stratified sampling. For a formal expression of the problem of 
optimum allocation in stratified sampling, consider the following notation. 

The subindex h = 1,2, ••• , H denotes the stratum, i = 1,2, ••■ ,Nf t or the unit 
within stratum h and j = 1, 2, • • • , G denotes the characteristic (variable). Moreover: 



Nh Total number of units within stratum h 

rih Number of units from the sample in stratum h 

Vhi 

of the j-th characteristic 
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n = 

w h 


(ni, ■■■ , 
N h 
N 


n H )' 


n 


N h 
i=l 

Nh 




Y h 


= (*!•■ 


■ ,Y G h y 


vi- 


«h 

i=l 




rih 




y h = 




,€)' 


v j 

y ST 


h=l 


V 

J ST 


{y ST > 


' y st 


c-2 







Vector of the number of units in the sample 
Relative size of stratum h 



Population mean in stratum h of the j-th characteristic 
Population mean vector in stratum h 

Sample mean in stratum h of the j-th characteristic 

Sample mean vector in stratum h 

Estimator of the population mean in multivariate 
stratified sampling for the j-th characteristic 

Estimator of the population mean vector in 
multivariate stratified sampling 
Population variance in stratum h 



i — 1 

N~ h 



s\j Sample variance in stratum h 

^2(vL - vif 



„2 _ trf 

§h = (S^i, ■ ■ ■ , Sl G ) Vector of population variances in stratum h 
s h = (shi, ■ ■ ■ , s ha)' Vector of variance estimators in stratum h 
Var(y^ T ) Estimated variance of y 3 ST 



A w, 

h=l 



N 

Ch Cost per G-dimensional sampling unit in stratum h and let 

c = (ci, • • • , cq)' . 



Define V u (y ST ) G K G , as V u (y ST ) = (Var(^ T ), • • ■ ,Var(y G )J , where if a € K G , a' 

denotes the transpose of a. Finally, observe that V u (y ST ) is a function of n, then it 
shall be written either V u (y ST ) or V u (y ST (n)). Similarly, it shall be written Var(y^, T ) or 
Var(^ T (n)). 

3 Limiting distribution of the sample variance 

In this section the asymptotic distribution of the estimator of the vector of variances Sh is 
studied. With this aim in mind, th e mult i variat e version of Hajek's theorem in the context 



of sampling theory is presented, see lHajeld (Il96ll ). In what follows, from Lemma [3 . 1 1 through 



Theorem 13. 1\ asymptotic results are stated for a single stratum. The notation N v and n v 
denote the size of a generic stratum and the size of a simple random sample from that 
stratum. 
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Lemma 3.1. Let q v be a G x 1 random vector defined as 



Qv 



Qv 



n, 



;\ 2 



(2) 



j=l,...,G 



Assume that for A = (Ai, . . . , Ac)', any vector of constants, 



X' (M$ - S V S' ) A > e max 



\]e> G (Mi - §X) 4 



(3) 



where eP G = (0, . . . , 0, 1, 0, . . . , 0)' is the j-th vector of the canonical base of e > and 
independent of u > 1 and 



M 



1 ( Nv —k 

\j=i 



A 2 / _i \ 2 

tJvi ± v 



/ k,l=l,...,G 

Suppose that n u — > oo, N v — n v — > oo, N u — > oo, and that, for all j = 1, . . . , G, 



lim 

v— >oo 



> lim 

v— >oo 



max > 

l<ii<-<i„„<iV„ f-^ 



, \ 2 



t>vi n v j ^vj 



(4) 



where 



m 



^ N, 



i=i 



yli - vi , 



Then, q u is asymptotically normally distributed as 

Qv -^Ng(E(q u ),Cov(q„)), 

with 



E(ff„) 



n v -l 



and 



Cov(q v 



n v is the sample size for a simple random sample from the v-th population of size N u . 

, . .%2 1 il 1 

Remark 3.1. Let g u as in ([2]), taking m = G and a v i = []/ ui — Y u ) in iHaiekl ( 196ll ). it is 
obtained that: 



(5) 
(6) 



i) q v can be expressed as 



^ K%a V R vi 



j=l,...,G 
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with b's fixed, furthermore b v \ 
Then 

max (b V j — by) 
lim 1 ~ J ~ jV " = 0, where b 

E(^-^) 2 



Kn v = o. 



i=l 



i=l 



holds if n v — > oo, iVj, — — > oo. 
ii) is 



-Y 



8=1 



N E ^ 



8=1 



o2 



iii) From (7.2) in lHaiekl (|l96lh 



E 

i=l 



i=i 



> e max 
i<i<fc 



i=l 



a- 3 ') 2 



(7) 



In the context of sampling theory, the right side in ([7J) can be written as 

2 



E 

8=1 



Em 



i=i h=i 



• - Y 3 



c2 



A',, 



8=1 



o2 

°!/l 



c2 



N v 

8=1 



JV V A'[ 



c2 



1 



.G TfG 



Si Si 



m u kl ^fc^J fc i i=i i ... iG A 



N„\' {Mi - SX) 



(8) 



where M?, is 



1 

A 7 ^ 



J k,l=l,...,G 



(9) 
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Similarly the right side of © is 



i=l 



Then, proceeding as in 3., 



N v 



N v 
i=l 

1=1 



e G e G 



-G 



v 3 -Y 3 



S 



Kl 



Kl 



(10) 



8=1 



Therefore, from (JBJ) and ([10]) . ([3]) is established. 

iv) The expression for ([3]) is found analogous to the procedure described in item 3. 

v) Finally, 



E((?) 



1 



n,. 



n,. 



v 3 ■ - Y 3 



i=i,...,G 



i=l,...,G 



\i=l / 



j=l,...,G 



n v — 1 



Similarly, by independence 
Cov(e) 



' \i=l / 3 =i 



J=1,...,G 



K - 1) 2 



E 



j=l,...,G V«=l / j=l,...,G 



E 



k i=l 



i=l G \»=1 



j = l,...,G. 



(n„ - l) 2 

where is defined in ([9]). 

Theorem 3.1. Under assumptions in Lemma l3.il i/ie sequence of vector of sample vari- 
ances s u are such that s u has an asymptotical normal with asymptotic mean and covariance 
matrix given by |5|) and (GJ), respectively. 
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Proof. This follows immediately from Lemma 13. 11 only observe that 



n. 



1 ( nv 



n,. 



where 



n,. 



n,. 



1 and (yl — Y J U ) 2 — > in probability, j = 1, . . . , G. 



2 



i v« - ^ 



j=l,...,G 



Tli. 



□ 



□ 



As a direct consequence of Theorem 13,11 it is obtained: 
Theorem 3.2. Let V u (y ST ) be the estimator of the vector of variances ofy ST , then 

H 

( w h - 

n h N 



v u {y ST ) = 2^ ( — I 



h=l 



is asymptotically normal distributed; moreover 



v u (y qT ) 4 N k ( E ( V u (y) ) , Cov ( y u (y 



ST 



where 



H 



E[V u (y ST 
Cov(K(y S3 



h=i v ft 



and 



M 4 , 



1 



h=l 

N h 



W h 2 W h \ n h 



^ \ n h N J {n h - 1) 



AT 7 n ft - 1 

(Mt-S h S' h ) 



(11) 
(12) 



8=1 



k,l=l,...,G 



n u is the sample size for a simple random sample from the v-th population of size N u . 

Observe that the asymptotic means and covariance matrices of the asymptotic normal 
distributio ns of Sh and V lt (y ST ) are in terms of the population parameters Y^, S^, and M^; 
then, from Rao ( 19731 ). approximations of the asymptotic distribution are obtained making 
the following substitutions 



§ h -)■ s h , and M. 



where 



1 f nh ( 



m, 



2 ' l -l^ 2 

Vhi ~ Vh , 



(13) 



k,l=l,...,G 
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4 Optimum allocation in multivariate stratified random sam- 
pling via multiobjective stochastic mathematical program- 
ming 

When the variances are the objective functions, subject to certain cost function, the opti- 
mum allocation in multivariate stratified random sampling can be expressed as the following 
deterministic multiobjective mathematical programming 



min V u (y ) 



mm 

n 



/ Var(yM \ 



V Var(y« ) / 



subject to 
c'n + Co = C 
2<n h <N h , h = l,2, 
n h 6 N, 



(14) 



which has been studied in detail by iDfaz-Garcia and Ulloa (2008). 

Observing that V u (y ) is a random variable then, as in (pQ), the optimum allocation 
via stochastic mathematical programming can be stated as the following stochastic multi- 
objective mathematical programming problem 



/ Var(yi 



min V u (y ) 



mm 

n 



ST' 



V Vax(yf_ 



subject to 
c'n + Co = C 
2<n h <N h , h = l,2,...,H 

V4y ST ) 4 Ng (e (v u (y ST )) , Cov (v u (y ST ; 
n h e N, 



(15) 



where E ( V u (y )) and Cov (V w (y )) are given by (fTT|) and (fT2|) respectively. 



In the statistical context, two approaches can be proposed to solve problems of stochastic 
multiobjective mathematical programming of the type (|15p : 



1. 



First, proposing a solution to the multiobjective mathematical programming problem, 
by converting it to an uniobjective mathematical programming problem. Then this 
uniobjective mathematical programming problem is solved as a stochastic program- 
ming problem by applying any of the techn iques in the area. Tlri s ap proach has been 



mmg problem Dy applying any oi tne tecn mques m tne area, - i nis ap proacn nas De( 
applied in other areas of the Statistics, see Khuri and Cornell ( 19871 ) among others 



2. Given that methodologies exist, problems of the type (115]) can be solved by applying 
directly the techniques of stochastic multiobjective mathematical programming, see 
Stancu-Minasian ( 19841 ) among others. 
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5 First approach 

For this ap proach, the literature of multi o bjective mathematical programm i ng off e rs plenty 
of too ls, see Diaz-Garcia and Ulloa ( 20081 ). Rfos. Rios Insua and Rfos Insua ( 19891 ). Miettinen 
(|l999h and lSteueri ()1986l ). Here, the techniques of value function and goal programming are 
considered. Under the value function approach, (|15p is formulated as 



mm 



V M (y c 



(16) 



ST' 

subject to 
c'n + Co = C 
2<n h <N h , h = l,2,...,H 

V u (y ST ) 4 N G (e (v u (y ST )) , Cov (v u (y, 

n h G N, 

where (/>(•) is a value function^ that summarises the importance of each of the variances of 
the G characteristics. 

Similarly, in terms of goal programming (I15p is stated as 



min -F(n, §) = min > Wjid^ + d- ) 
subject to 

Var(^ ST (n)) + d+ -dj=t J , j = 1, . . . , G, 
c'n + Co = C 
2<n h <N h , h = l,2,...,H 

V u (y ST ) 4 N G (e (v u (y ST )) , Cov (v u (y ST ) 

n h G N, 



(17) 



with 



1 / 



( Var(^ ST (n)) - t, + (Var(^ ST (n)) - t 



i 2 

dj = l - (|va^ ST (n)) -tj\- (Var(y{ T (n)) 

where the tj's are the target values for the objectives Var(y^ T ), 

Note that, now, (|16p and (|17p are stochastic uniobjective mathematical programming 
problems, then, any technique from the area of stochastic programming could be applied. 
For example: 



1,...,G. 



5.1 Modified expected value solution, i?-model and F-model 

Point n G ~H H is the expected modified value solution to (I16p if it is an efficient solution 
m the ParetcH sense to the following deterministic uniobjetive mathematical programming 

: A value function is a function <j) : ?R H — > K such that minV u (y sr (n*)) < minV u (y ST (ni)) 
0(V u (y ST (n*))) < 0(V„(y sr (m))), n* + m. 

2 For matrix mathematical programming problems, there rarely exists a point f*(n) which can be con- 
sidered as the minimum. Alternatively, it said that f*(n) is a Pareto point of f (n) = (/i(n), . . . , fa(n))', if 
there is no other point f 1 (n*) such that f 1 (n) < f*(n), i.e. for all j, /^(n 1 ) < /j(n*) and f 1 (n) 7^ f*(n). 
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problem 



min k\ E 

n 



V u (y ST )j)+A; 2 ^Var 

subject to 
c'n + co = C 
2<n h <N h , h = l,2,. 
n h £ N, 



(18) 



H 



and for (fTTD 



3 = 1, 



H 



G, 



(19) 



min h E (F(n, S)) + A; 2 VVar(F(n,S)) 

n 

subject to 
V^(yi T (n)) + d+-dJ=tj, 
c'n + co = C 
2<n h <N h , h = l,2, 
n h G N, 

Here k\ and & 2 are non negative constants, and their values show the relative importance 

of the expectation and the variance of <MV u (y )) and F(n, 8). Some authors suggest 

1 J 7\ / 

1, see iRaol (j 19791 . p. 599). Observe that if we take k± = 1 and fc 2 = in 

the resulting method is known as the E-model. Alterna t ively, i f fci =0 a nd ko = 1, 

the method is termed the V -model, see ICharnes and Cooper! (119631 ). IPrekopal (119951 ) and 



that k\ + /c 2 

(us 



Urvasev and Pardalosl (|200ll ) 



5.2 Minimum risk solution of aspiration level r, P-model 

Point n E N H is a minimum risk solution at the aspiration level r to the problem (I16p (also 
termed P-model by ICharnes and Cooper! (|l963l )) if it is an efficient solution in the Pareto 
sense of the uniobjetive stochastic mathematical programming problem 



minP(0(V u (y ST )J <r 

subject to 
c'n + co = C 
2<n h <N h , h = l,2,... 
n h G N. 



(20) 



II 



and correspondingly, for (j 1 T P : 



minP(F(n,S) < n) 

n 

subject to 

Var(y^(n)) + d+ -dj=t j , j = l,..., G, 
c'n + co = C 
2<n h <N h , h = l,2,...,H 

n h G N, 



(21) 



for an specified aspiration level t\. 

Among the multiobjective techniques we find that the value function method is, in 
general, the most commonly app l ied, i ts properties have b een studied w ith most detail, see 
Rios. Rios Insua and Rfos Insual ^fl989l ) . iMiettinenl (Il999h . lsteuerl (|l98fih . and the references 
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therein. Note that the value function (fr(-) can be defined in an infinite number of forms, 
which represents a great obstacle for its definition. Fortunately, some simple functions have 
given excellent results in applications and they can be considered as promising approaches. 
One of these particular forms is the weighting method. Under this approach, problem (|16p 
can be expressed as: 

G 

min ^^Var(y J ST ), 
i=i 

subject to 

c'n + c = C (22) 
2<n h <N h , h = l,2,...,H 

V u (y gT ) A A4 (E (v„(y OT )) ,Cov (v„(y ST ))) 
n h G N, 

G 

such that Wj = 1, Wj > V j = 1, 2, . . . , G; where the u^-'s weight the importance 
of each characteristic. 



6 Second approach: Stochastic multiobjective mathematical 
programming approaches 



In this section, solutions for problem (|14p are proposed under diverse stochastic multiobjec- 
tive mathematical programming approaches. The properties of th e solution obtained unde r 
the different a pproa ches are described in detail by Kataoka ( 1963 ). Stancu-Minasian ( 19841 ) 
and lPrekopal (jl995l V 



As shall be seen, each stochastic multiobjective mathematical programming approach 
can be stated in several ways. In some cases, these possibilities are a consequence of 
assuming whether Var(y^ T ) j = 1, . . . , G, are correlated or not. 

6.1 Multiobjetive expected value solution, multiobjetive E-model 

Point n € N G is the expected value solution to (|14p if it is an efficient solution in the Pareto 
sense to the following deterministic multiobjective mathematical programming problem 

min E (v u (y ST )), 
subject to 

c'n + c = C ( 23 ) 
2<n h <N h , h = l,2,...,H 
n h e N. 

6.2 Multiobjetive minimum variance solution, multiobjetive V-model 

The point n G N G is the minimum variance solution to the problem (|14p if it is an efficient 
solution in the Pareto sense of the deterministic multiobjetive mathematical programming 
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problem 



mm 



Var 



Var(y^ T ) 



\ 



(24) 



11 

, Var (Var(y^ T ) j j 

subject to 
c'n + Co = C 
2<n h <N h , h = l,2,...,H 

n h G N, 

This efficient solution is adequate if it is assumed that Var(y^ T ), j = 1, . . . , G are uncorre- 
cted, however if they are correlated then a more adequate approach is: 

The point n G N G is the minimum variance solution to the problem (|14p if it is an 
efficient solution in the Pareto sense of the deterministic matrix mathematical programming 
problem 

min Cov (v u (y gT )) 

subject to 
c'n + Co = C 
2<n h <N h , h = l,2, 
n h G N. 

A general approach is studied by lDiaz Garcia and Ramos-Quiroga f|200lh . 



(25) 



H 



6.3 Multiobjetive expected value standard deviation solution, multiob- 
jetive modified E-model 

Point n G N G is an expected value standard deviation solution to the problem (|14p if it 
is an efficient solution in the Pareto sense of the mixed deterministic multiobjetive-matrix 
mathematical programming problem 



mm 

n 



E V u (y £ 



Cov 



V u (y ST 

subject to 
c'n + Co = C 

2<n h <N h , h = l,2 
n h G N. 



1/2 



(26) 



II 



where (A 1 / 2 ) = A, see llVIuirheadl (jl982j . Appendix). 

We now define the concept of efficient solution multiobjetive minimum risk of joint 
aspiration level r = (ti, T2, . . . , tq)' and the efficient solution with joint probability a. Both 
solutions are obtained, respectively, by applying the multivariate versions of minimum risk 
and Kataoka criteria, referred to in the literature as criteria of maximum probability or 
satisfying criteria, due to the fact that, as we shall see, in both cases the crite ria to be 



used p rovide, in one way or another, "good" solutions in terms of probability, see iKataoka 
(Il96 
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6.4 Multiobjetive minimum risk solution of joint aspiration level r, mul- 
tiobjetive modified P-model 

Point n € N G is a minimum risk solution at joint aspiration level r to the problem (|14|) if 
it is an efficient solution in the Pareto sense of the multiobjetive stochastic mathematical 
programming problem 

/ P (Var(yi ) < \ 



mm 



^ P (Var(y 



G ' 

ST' 



subject to 
c'n + Co = C 
2<n h <N h , h = l,2,.. 
n h 6 N. 



(27) 



H 



It is also possible consider the follow alternative multiobjective P-model 

/ Var(yi ) < n 



min P 

n 



V Var(y 



ST- 



subject to 
c'n + Co = C 
2<n h <N h , h = l,2,.. 
n h e N. 



(28) 



H 



Again, (j28|) is more adequate if the response variables are co rrelated. Howe ver (|28p is 
sensibly more complicated to solve that (|27|) . When G = 2, Prekopa (jl970l ) proposed 
an algorithm in a similar problem (probabilistic constrained programming), which can be 
apply to solve (f28|h 



6.5 Multiobjetive Kataoka solution with probability a 

Point n £ N G is a multiobjetive Kataoka solution with probability a (fixed) to the problem 
(|14p if it is an efficient solution in the Pareto sense of the multiobjetive mathematical 
programming problem 

min r 

n,T 

subject to 

P (Var(F ST ) < t 3 ) = a, j = 1, . . . , G (29) 

c'n + Co = C 
2<n h <N h , h = l,2,...,H 
n h G N. 



14 



Alternatively (J29]) can be proposed as 



mm r 

n.T 

subject to 



V Var(y« ) < r r 
c'n + cq = C 
2<n h <N h , h = l 
n h G N. 



= a 



(30) 



Note t h at (1291) and (1301) are mult iobje ctive probabilis tic constrained programs, see lCharnes and Cooper 
dl963l V IStancu-Minasian l ij 19841 ) and lPrekopal <jl9flfj >. 



Stancu-Minasian 



Many others approaches can be used to solve (|14p . For example, 

propose a stochastic version of a sequential technique termed Lexicographic method, 
for solving (I24D and (|27h . or to applying it directly to (|14H : among many others options. 



7 Equivalent deterministic programs 



In this section we study in detail several particular deterministic programs equivalent to 
the stochastic multiob jective program (I15p . Consider first the following remark: from (jllj) . 
(Hl|), (USD andi = l,...,G, 



E(Var(l4))=£ 



7i=l 



N J n h - 1 



Var (Var(^ ST )j = 



W h 2 W h 



n h N J (n h - 1) 



Jy2 ( m hj ~ ( S hj) 2 ) i 



and 



^4 

m hj 



1 71,1 / 

a i=l 



7.1 Multiobjective V- model 

From (|24p the multiobjective deterministic problem equivalent to (|15p via the V- model is 



mm 



Var Var (yM 



(31) 



y Var (Var(y^ T )J J 

subject to 
c'n + Co = C 
2<n h <N h , h = l,2,...,H 
n h G N. 

Which is solved by applying any of the multiobjective mathematical programming tech- 
niques. 
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7.2 Multiobjective P-model 

Proceeding as in Diaz Garcia et al. (|2005h . the multiobjective deterministic problem equiv- 



alent to (1151) via the P-model ([2 



is 



mm 

n 



( n - E (Var(yl T )) ^ 
Va7 (VaJ(yi T ) 

r G - E (Var(y 



'ST- 



\ 



Var (Var(y G T ) 

subject to 
c'n + Co = C 
2<n h <N h , h = l,2,. 
n h G N. 



(32) 



H 



7.3 Multiobjective Kataoka model 



From lDiaz Garcia et al\ (]2005l ). the multiobjective deterministic problem equivalent to ([15 
via the Kataoka model (129j) is given by 



' E (Vaifc)) + ^(5) J Vac (Vax(y l ST ^ 



mm 

n 



V 



E (Var(^ T )J + fc-^a) WVar (Var(l£ T ) 



(33) 



subject to 
c'n + Co = C 
2<n h <N h , h = l,2, 
n h e N. 



H 



where $ denotes the distribution function of the standard Normal distribution. 

Similar multiobjective deterministic problems equivalent to (|15h are obtained applying 
the other stochastic solutions described in Section^ Note that, if we combine each stochas- 
tic solution with each multiobjective optmisation technique, we obtain an infinite number 
of possible solutions of (|15p . For example, note that the function of value /(•) may take 
an infinite number of forms. One of t hese particu l ar for ms is the weighting method, see 
Rfos. Rios Insua and Rios Insua ( 19891 ) and Steuer ( 19861 ). Under this approach, problem 
(l33l) can be solved as: 



G 



min <E \Yai{y j ST )J + $ 1 (S) J Var (Var(y 




subject to 
c'n + Co = C 
2<n h <N h , ^ = 1,2, 
n h G N. 



(34) 



H 
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such that Ylf=i w j = 1, tUj > V j = 1, 2, . . . , G: where uij weights the importance of each 
characteristic. This solution can be termed the multiobjetive Kataoka-weighting solution 
with probability a equivalent to the problem (|15|) . via the weighting method. 

Similarly, the multiobjetive Kataoka solution with probability a to the problem (|14p . 
via the goal programming is 

G 

min ^2 w j {dj + dj ) 
subject to 



E (Var(^ T )j + <D-i(J) ^Var (Var(F ST )J - d} + dj = Tji j = 1, . 

c'n + Co = C 
2<n h <N h , h = l,2,...,H 
n h € N. 



-,G, 



(35) 



where 



E (Vax(£> T )J + ^(J) WVar (Var(y- 



' ST' 



+ E ( Var(^ ST )J + WVar (Var(F ST ) ) - r- 




E (Var(^)J + $- x ((5) WVar (Var(y- 



1 ST' 



E ( Var(^ ST ) ) + r^) WVar (Var(F ST ) 



Note that so far, the cost constraint c^n^ + cq = C has been used in every stochastic 



h=l 



multiobjective mathematical programming method. However, in several situations, it can 
be used to represent existing restrictions for the availability of man-hours for carrying out a 
survey, or restrictions on the total available time for perfor ming the survey, etc. These c ases 
can be implemented by using the following constraint, see Arthanari and Dodge ( 198ll ): 



H 

h=l 



n. 



8 Application 



Consider data from a case study presented bv lArvanitis and Afonia (|l97lh . They study a 
forest survey conducted in Humbolt County, California. The population was subdivided 
into nine strata on the basis of the timber volume per unit area, as determined from aerial 
photographs. The two variables included in this example are the basal areaEI, (BA), in 
square feet, and the net volume in cubic feet (Vol.), both expressed on a per acre basis. 
The variances, covariances and the number of units within stratum h are listed in Table 1. 



In forestry terminology, 'Basal area' is the area of a plant perpendicular to the longitudinal axis of a 
tree at 4.5 feet above ground. 
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Table 1: Variances, covariances and the number of units within each stratum 



Variance 



Stratum 


N h 


BA 


Vol. 


Covariance 


1 


11 131 


1 557 


554 830 


28 980 


2 


65 857 


3 575 


1 430 600 


61 591 


3 


106 936 


3 163 


1 997 100 


72 369 


4 


72 872 


6 095 


5 587 900 


166 120 


5 


78 260 


10 470 


10 603 000 


293 960 


6 


51 401 


8 406 


15 828 000 


357 300 


7 


24 050 


20 115 


26 643 000 


663 300 


8 


46 113 


9 718 


13 603 000 


346 810 


9 


102 985 


2 478 


1 061 800 


39 872 



In this example, the stochastic multiobjective mathematical programming problem un- 
der approach (fT5|) is 



nun 



Var<y 



n" y Vax(y 2 ST ) J 
subject to 

9 

^n h = 1000 

h=l 

V u (y ST ) 4 N 2 (E (V u (y)) , Cov ( V u (y ST ) 



(36) 



ST' \^ l ■ u\j ST 

2<n h <N h , h=l,2,..., 
n h G N, 



where E ( V u (y ) I and Cov ( V u (y ) I are given by (fTTI) and (fT2j) respectively. 



8.1 Particular solutions under the first approach 



Taking into account (|TTj) , (|T2|) and (|T3j) , and from (|T8j) and (|22j) the equivalent deterministic 
problem to (|36|) is 



mm < 

n 



k\ ^2 WjE Var(y 



~,3 ^ 

ST ' 



+k 2 



J2 w jV &T Var(yJ T ) + 2 Wl w 2 Cov (Var^), Var(y^) 
i=i 

subject to 

9 

^2 n h = iooo 

h=l 

2<n h <N h , /i = l,2,...,9 

n h G N, 

where w\ + w 2 = 1, and k\ + k 2 = 1 , rj (fixed) > V j = 1,2, whose solution is 
termed: the weighting-modified E— model solution. 
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Similarly, from (|2ip . the weighting-P— model solution of (|36p is obtained solving the 
following equivalent deterministic problem 



mm 



r — ^2 w jE ~V&r(y- 



ST' 



^w|Var Var(y^ T ) + 2w 1 W2Cov ^Var(y* T ), Var(y 

subject to 

9 

^2 n h = 1000 

/i=i 

2<n h <N h , h = l,2,...,9 

n h G N, 



-2 ' 

ST- 



8.2 Particular solutions via the second approach 

From Section [71 using the weighting method, the multiobjective V —model-weighting model 
solution of (|36p is obtained from the following equivalent deterministic problem 



min ^2 w j { Var ( Var (yi; T )) } 



subject to 

9 

^n h = 1000 



2<n h <N h , h = l,2,...,9 
n h G N. 

Analogously, the multiobjective P— model-weighting model solution of (f36j) is obtained by 
solving the following equivalent deterministic problem 



min Wj 



E (Var(yi 2 



i=i 



Var 



(^Var(y- 

subject to 
9 

^n h = 1000 



h=l 



2<n h <N h , h = l,2,...,\ 
n h G N. 



Similarly, solving the following equivalent deterministic problem, the multiobjetive Kataoka- 
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weighting solution of (|36p is obtained 



mm J2wj |g (Var(yj r )) + Of' 1 (8) ^Var (var(F ST )) | 



subject to 

9 

^ n h = 1000 

/i=i 

2<n h <N h , h = l,2,...,\ 
n h G N. 



In all cases, 5Zj=i w j = w j > V j = 1, 2: where the Wj's weight the importance of each 
characteristic. 

Table 2 shows the solutions obtained by some of the methods described in Sections [5l in 
particular, the weighting-modified E— model, weighting- E— model and weighting-!/— model 
are presented. Also, the solutions described in Section are included, specifically the mul- 
tiobjective V— model-weighting, P— model-weighting and Kataoka model-weighting models. 
The first two rows in Table 2 include the optimum allocation for each characteristic, BA 
and Vol. The last two columns show the minimum values of the individual variances for 
the respective optimum allocations identified by each method. The results were com puted 
using the commercial software Hyper LINGO/PC, release 6.0, see IWinston (jl995h . The 



default mathematical programming methods used by LINGO to solve the nonlinear inte- 
ger mathematical programmin g programs are Gene ralised Reduced Gradient (GRG) and 
branch-and-bound methods, see Bazaraa et al. ( 20061 ). Some technical details of the compu- 



tations are the following: The maximum number of iterations of the methods presented in 
Table 2 was 2364 (multiobjective Kataoka-weighting solution) and the mean execution time 
for all the programs was 2 seconds. Finally, observe that the greatest discrepancy found 
by the different methods among the sizes of the strata occurred under the multiobjective 
P— model-weighting solution. In part, this discrepancy is a consequence of the particular 
values for t\ and T2 in this method. 



Conclusions 

It is important to stress that there is a potentially infinite number of possible solutions to 
a stochastic multiobjective mathematical programming problem. Simply note that such an 
infinite number due to the flexibility in choosing the value function. Therefore, this paper 
presents only some few techniques of the area of multiobjective, stochastic and stochastic 
multiobjective mathematical programming, which when combined, produce a number of 
possible solutions to the problem of optimum allocation in multivariate stratified random 
sampling from a stochastic multiobjective mathematical programming point of view. 

Because of all these possibilities, it is difficult to set general rules for how to use these 
techniques, leaving all responsibility on the skill of an expert in the area to determine which 
of these approaches is best in a particular application. 

As the reader can see, several solutions are given in the context of the application, and, 
for the sake of complet e ness, the solutions in terms of goal programming are only indicated, 
see Kazemzadeh et al. ( 20081 ). 
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Table 2: Sample sizes and estimator of variances for the different allocations calculated 



Allocation 


ni 


W2 


m n4 ns ne 


n 7 


ng 


ng 


Var(yi T ) 


Va?& 2 ST ) 


BA 


10 


94 


144 136 191 113 


81 


109 


122 


5.591 


5441.105 


Vol 


7 


62 


119 136 200 161 
First approach" 


98 


134 


83 


5.953 


5139.531 


Weighting-modified E— model 


8 


16 


77 119 191 ~91 


158 


161 


49 


7.311 


5593.494 


Weighting- E — model 


7 


63 


119 135 200 160 


98 


134 


81 


5.936 


5139.645 


Weighting- V —model 


8 


16 


77 119 191 121 
Second approach' 


158 


161 


19 


7.526 


5997.963 


Multiobjctivc 


















V— model- weighting model 


8 


16 


77 119 191 191 


158 


161 


19 


7.311 


5593.494 


Multiobjetive 


















P— model-weighting modcl c 


247 


12 


34 77 127 106 


221 


117 


29 


11.817 


8980.960 



Multiobjetive 

Kataoka-wcighting modcl^ 8 46 77 119 191 191 158 161 49 7.311 5593.494 



"With fci = k 2 = 0.5 and Wl = w 2 = 0.5. 

6 Whcre w\ = 11)2 = 0.5. 

c Where t\ = 6 and ri = 6000. 

d With $ _1 (<5) = 1.645 
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